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Abstract 

In this paper we study /c-noncrossing RNA structures with minimum arc-length 
4 and at most k — 1 mutually crossing bonds. Let (n) denote the number of k- 
noncrossing RNA structures with arc-length > 4 over n vertices. We prove (a) a 
functional equation for the generating function ^2 n>Q T^(n)z n and (b) derive for 
k < 9 the asymptotic formula T^(n) ~ Ck n~^ k ~ 1 ^ 2+ ^ k ~ 1 ^ 2 " 1 7^ n . Furthermore we 
explicitly compute the exponential growth rates 7^ and asymptotic formulas for 
4 < k < 9. 

Keywords: RNA pseudoknot structure, generating function, singularity analysis, 

/c-noncrossing diagram, exponential growth rate. 
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1. Introduction 



RNA pseudoknot structures [21 [28] are a reality. They occur in functional RNA 
(RNAseP [H]), ribosomal RNA [T7] and are conserved in the catalytic core of group 
I introns. Due to the crossings of arcs their theory differs considerably from RNA 
secondary structures. Pseudoknots are inherently noninductive and the standard 
dynamic programming folding paradigm employed for RNA secondary structures 
can only generate particular subclasses of pseudoknot structures [2~T]. Recently the 
concept of fc-noncrossing RNA structures has been introduced p3j. Here the idea is 
that the complexity of the structure is captured by an inherently "local" property: 
the maximal number of mutually crossing bonds. A structure is fc-noncrossing, if 
there exists no k-set of mutually crossings arcs. The locality is in fact of central im- 
portance: point in case are RNA bisecondary structures introduced by P.F. Stadler 
[12J. These structures are constructed as superpositions of two RNA secondary 
structures and correspond to planar 3-noncrossing structures [H]. The planarity 
property is clearly non-local and at present time the generating function for RNA 
bisecondary structures is not known. 

A very intuitive approach to the A;-noncrossing property of RNA molecules is their 
diagram representation [12]. It is obtained by drawing the nucleotide-labels 1, . . . , n 
in increasing order in a horizontal line and drawing the arc-labels (i, j) in the upper 
half-plane, if and only if i and j are paired in the structure, see Figure [TJ We 
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call a diagram fc-noncrossing, if it does not contain k mutually crossing arcs. The 
length of an arc is given by A = j — i and a stack of length a is a sequence 
of "parallel" arcs of the form (i + 1, j — 1), . . . , (i + (er — 1), j — (a — 1))). A 

/c-noncrossing RNA structure is a fc-noncrossing diagram over [n] having minimum 
arc-length A > 1. These structures have been studied in [TH [T5] via a bijection into 
vaccillating tableaux in the context of tangled diagrams [I]. For the enumeration 
of structures with crossing arcs the tableaux-interpretation is non-optional. There 
is, to the best of our knowledge, no way to inductively construct fc-noncrossing 
structures, despite the fact that they are D-finite. 

For RNA secondary structures (2-noncrossing RNA structures), certain combinato- 
rial restrictions, for instance minimum arc-length or stack-size are relatively straight- 
forward to deal with. The combinatorics and prediction of RNA secondary structures 
has been pioneered by Waterman et al. in a series of excellent papers [201 E6J [25j [271 
[TO] . He proved for the number of RNA secondary structures of length n (arc- length 
> 2), T!j (n), the fundamental recursion 

n—3 

(1.1) Jf{n) = Jf{n + Jf{n - 2 - s)jf{s) , 

s=0 

where T^O) = Tj (1) = T^ 2 '(2) = 1. Eq. (11. ip is an immediate consequence 
considering secondary structures as peak-free Motzkin-paths, i.e., peak- free paths 
with up, down and horizontal steps that stay in the upper halfplane, starting at the 
origin and end on the x-axis. The recursion is in particular the key for all asymptotic 
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results since it immediately implies a functional equation for the corresponding 
generating function. This allows the application of Darboux-type theorems [TTJ [23] . 
For the number of secondary structures with minimum arc-length A, T^n), it is 
straightforward to derive 

ra-(A+l) 

(1.2) T^\ n)=T ^\ n - 1)+ T[ X \n-2-s)T[ X \s) . 

s=0 

All asymptotic formulae for secondary structures are of the same type: a square 
root. In other words, the asymptotic behavior is determined by an algebraic branch 

3 

singularity with the subexponential factor n~2. 

The situation changes for /c-noncrossing RNA structures. A different approach has 
to be made, since in lack of functional equations Darboux-type theorems [23], can- 
not be employed. The idea is to analyze the dominant singularities directly, us- 
ing Hankel contours. Singularity analysis has been pioneered by P. Flajolet and 
A.M. Odlyzko [?]. Its basic idea is the construction of the "singular-analogue" of 
the Taylor-expansion. It can be shown that, under certain conditions, there exists 
an approximation, which is locally of the same order as the original function. The 
particular, local approximation allows then to derive the asymptotic form of the 
coefficients. In contrast to the subtraction of singularities-principle [H] the only 
contributions to the contour integral come from segments close to the singularity. 
In our situation all conditions for singularity analysis are satisfied, since the gener- 
ating functions involved are D-finite [22l [30] and D-finite functions have an analytic 
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continuation into any simply-connected domain containing zero. Our approach also 
works for tangled diagrams [5], which represent the combinatorial framework for 
RNA tertiary interactions. Our analysis confirms that the particular singularity- 
type of the generating function of fc-noncrossing RNA structures depends solely on 
the crossing number [Tj)J HE] . While the location of the singularity shifts as a func- 
tion of the arc-length, all subexponential factors remain the same. Furthermore an 
interesting feature is the appearance of logarithms for k = 1 mod 2 in the singular 
expansion. 

Due to biophysical constraints a minimum arc-length of four can be assumed for 
minimum free energy RNA structures. The key objective of this paper is to derive 
and analyze the generating function for fc-noncrossing RNA structures with mini- 
mum arc-length 4, see Table [U Based on our results the next step is to compute 
the subset of canonical structures, i.e. the subset of structures with arc-length > 4, 
having no isolated arcs. While it is straightforward to obtain eq. (11.21) from eq. (II. ip 
considerable complication arises, when considering /c-noncrossing structures with 
arc-length > 3. To understand why, one observes that the number of ways to place 
3-arcs satisfies a new type of recursion, see eq. (13.61) . As a result and in contrast 
to /c-noncrossing structures with minimum arc-length A < 3 the generating function 
Y2 n >o^k\ n ) z ™ turns out to be a sum of two power series (Theorem [2]). The expo- 
nential growth rate can easily be computed via the formula given in Theorem [3], see 
Table [2] and Figure [3 
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The paper is organized as follows: in Section [2] we provide the background on the 
methods used in this paper. In Section [3] we prove a functional equation relating 
RNA structures to /c-noncrossing matchings. We then study the singularity of the 
generating function and obtain the asymptotic formula in Section HI Finally, in 
Section we detail some key ideas instrumental for the proof of Theorem [2J 
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2. Preliminaries 



In this Section we provide some background on the generating functions of k- 
noncrossing matchings [31 [13] and /c-noncrossing RNA structures [Hl[T5]. We denote 
the numbers of /c-noncrossing matchings and RNA structures with arc-length > A by 
/fe(2n) and T^(n), respectively. The former corresponds to /c-noncrossing diagrams 
without isolated points and the latter to /c-noncrossing diagrams with arc- length > A. 
Furthermore, let T^ (n, £) denote the number of /c-noncrossing RNA structures with 
arc-length > A having exactly £ isolated points and Mfc(n) denotes the number of 
partial matchings, or equivalently the number of /c-noncrossing diagrams over [n] 
(i.e. with isolated points and minimum arc- length 1). Pfringsheim's Theorem [23] 
guarantees the existence of a positive real, dominant singularity of J2 n >o zU 
which we denote by In order to get some intuition about the various types of 
diagrams involved, see Figure [2j 

2.1. /c-noncrossing matchings. Our main objective is to discuss some basic prop- 
erties of /fc(2n) and to give an asymptotic formula. Let us recall that a power series 
u(x) is called Z)-finite over the function field K(x) if dim(w, u', . . . )k{x) < oo [22] . 
The generating function of fc-noncrossing matchings satisfies the following identity 
due to Grabiner et al. [5] 



(2.1) 




n>0 



k-l 

i,j=l ) 
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where 

y 2j+r 



(2.2) I r (2z) = 



Z 



7 (r + 7 )! 

denotes the hyperbolic Bessel function of the first kind of order r. Eq. (12.1 j) allows 
to conclude that 



:2.3) F fc ( 2 ) = ^/ fc (2n)^ 

n>0 



is .D-finite. Indeed, the hyperbolic Bessel function [9] itself is .D-finite and .D-finite 
functions form an algebra closed under taking Hadamard products [22] • Therefore 
D-finiteness of F^z) follows from eq. (12.11) . However, beyond the cases k = 2 and 
k — 3, eq. (12. ip does not give directly explicit formulas for /fe(2n) or F k {z). For 
small /c-values asymptotic formulas can be obtained using the approximation of the 
Bessel function 

(2-4) I m (z) =*(^tl£ H(4m 2 - (2t - lf)z~ h + 0{\z\~ H ) 



f 2~KZ \ ^ hit 

^"^ \h=0 t=l 

which holds for — f < arg(z) < | [I]. For arbitrary fc, systematic analysis of the 
determinant det [/;_,,• (2x) — I i+ j(2x)]\^j} =1 by Jin et al. [13] shows for arbitrary k 

(2.5) / fc (2n) ~ c k n -((^-D 2 H-(^i)/2) ( 2 (fc - l)) 2n , c k > . 

In the following we shall denote the dominant singularity of F k (z) by p k = 2 (k-i) • 



9 



2.2. fc-noncrossing RNA structures, /c-noncrossing RNA structures are 
fc-noncrossing diagrams satisfying specific arc-length conditions. The latter induce 
asymmetries (for instance 1-arcs are not preserved) which prohibit enumeration 
using Gessel and Zeilberger's reflection-principle [8] directly (the reflection principle 
implies eq. (12.1 j) ) . For any k > 2 the numbers of /c-noncrossing RNA structures with 
minimum arc-length > 2 are given by [Hj 

(2.6) lf(n) = £(-l) b ( n , Mfc(n " 26) 

and we have [151 



(2.7) Tf(n) ~ cfn^-^-^{lfr n , i 2 l>0 



[2] 

where 7^ is the unique, solution of minimal modulus of z i* z+1 = Pk- For k- 
noncrossing RNA structures with arc-length > 3 we have according to [H] 

(2.8) \/k>2; -rf(n) = ^ (-l) h \(n, b) M k (n - 2b) , 

&<LfJ 

where A(n, 6) denotes the number of way selecting b arcs of length < 2 over n vertices. 
The nonexplicit terms \{n, b) vanish in the functional equation 



,-[3], 

n>0 

,3 \ 2 ™ 



Z — Z 



(2.9) 

n>0 
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Singularity analysis based on eq. (12. 9p eventually allows to derive the asymptotic 
formula 



(2.10) Tt\n) ~ C j3] n -(( fe -l) 2 + ( fe -l)/2) (7 [3] ) -n ) c [3] >0 



[31 3 

where 7^ denotes the unique, minimal positive real solution of 1 _ z _* z $+ z 3_ z i = Pk- 



2.3. Singularity Analysis. Pfringsheim's Theorem [23J guarantees that each power 
series with positive coefficients has a positive real dominant singularity. This sin- 
gularity plays a key role for the asymptotics of the coefficients. In the proof of 
Theorem [3] it will be important to deduce relations between the coefficients from 
functional equations of generating functions. The class of theorems that deal with 
such deductions are called transfer-theorems [7j. One key ingredient in this frame- 
work is a specific domain in which the functions in question are analytic, which is 
"slightly" bigger than their respective radius of convergence. It is tailored for ex- 
tracting the coefficients via Cauchy's integral formula. Details on the method can 
be found in [221 [7] . In case of D-finite functions we have analytic continuation in 
any simply-connected domain containing zero and all prerequisits of singularity 
analysis are met. We use the notation 

(2.11) {f( z ) = O (g(z)) as z — > p} -<=>- s^-rr is bounded as z — > p\ 
The key result used in Theorem E] is 
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Theorem 1. [7] Let f(z),g(z) be D-finite functions with unique dominant singu- 
larity p and suppose 



where C is a constant and \z n ]h(z) denotes the n-th coefficient of the power series 
h(z) at z = 0. 



In this Section we compute the generating function of T k (n), the number of k- 
noncrossing RNA structures with arc-length > 4. Our first result is a technical 
lemma which is instrumental in the proof of Theorem [2] below. The proof of the 
lemma given below is new and uses integral representations [6] instead of dealing 
with the combinatorial coefficients directly. 

Lemma 1. Let z be an indeterminate over C. Then we have the identity of power 
series 



(2.12) 



f(z) = 0(g(z)) as p . 



Then we have 



(2.13) 




3. The generating function 



E M *(^ n = (jh) 5> (2n) (r^) 

n>0 ^ ' n>0 ^ ' 



2/i 



(3.1) 



V \z\ < p k ; 
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Proof. Expressing the combinatorial terms by contour integrals [6] we obtain 



(3.2) 



/ {l+uYu-^^du f k {2m) = ^-& F k {v)v- 2m - l dv 



2ml 2ixi 



\u\=a 



2-ni 



v\=P 



where a, f3 are arbitrary small positive numbers. We derive 



M fc (n) 



and furthermore 
UJn) 



(2iri 
1 



— T 



(1 + u) n u- lm - v F k (v)v- lm - Y dudv 



u\=a,\v\=f3 



{l + u) n - 



■uv 



(2m) 2 J\ u \ =aM=l3 (uv) 2 - 1 



-F k {y)dudv 



(2-Ki) 2 



\v\=f3 



{l + u) n u 

„|=a («+£)(«- I 



-du 



dv 



Since u = - and w = — ^ are the only singularities (poles) enclosed by the particular 
contour, eq. (13. ip implies 



(1 + u) n u 

ju|=a (U + 1)(U - i) 



(iti = 27T2 



(l+M) n M. (l + w) n W 



1 \u= 



I + 



1 \u= 



ixi i 


1 


n 


I 




1 - - 


+ 


1 + - 




V 




f 
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Therefore, for \z\ < flu 



n>0 



Atri 

1 

AlTl 



n>0 



k{V V 



\v\=P 





1" 


n 


1" 


( 


1 - - 


+ 


1 + - 




V 







1 



'\ v \=p v-(v-l)z AmJ lvl=(3 v-(v + l)z 

The first integrand has its unique pole at v — —jz^ and the second at v — 
respectively: 

1 11.1 11 



v — (v — l)z V + 1 — z 
In view of Fk(z) = Fk(—z) we derive 



and 



- (v + l)z v - 1 - z 



E M k(n)z n = 

n>0 

whence the lemma. 



2 Fk 



z 



1 I z 
f 2 k [l-z 



— - — F k (— 
l-z \ 1 -z 



□ 



Before we state the main result of this section, let us introduce some notation. We 

set 



(3.3) 
(3.4) 



u(z) = s/l + Az- Az 2 - 6z 3 + Az A + z t 
-2z 2 + z 3 - 1 + (-iy U (z) 



2(1 - 2z - z 2 + z A ) 
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Note that fj(z) is an algebraic function over the function field K(z), i.e. there exists 
a polynomial with coefficients being polynomials in z for which fj(z) is a root. This 
fact will be important when computing the subexponential factors of the asymptotic 
formula for T^(n). 



Theorem 2. Let k be a positive integer, k > 3 and fi{z) and f2{z) be given by 
eg. flff.^p . Then we have the functional equation 

n>0 ■ /JA y n>0 V 17 iV 77 



Proof. Claim 1. Let A(n, 6) denote the number of ways to place 6 arcs of length < 4 
over [n]. Then we have 

(3.5) T' 4] H = Yl (-l) b Hn,b)M k (n-2b) 

&<LfJ 

and X(n,b) satisfies the recursion 
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(3.6) 

\(n + 2b,b) = 

X(n + 2b-l,b) + X(n + 2b - 4, b - 2) + A(n + 26 - 5, b - 2) + A(n + 2b - 6, 6 - 3) 



-A(n + 26-3,6-l) , 

where A(n, 0) = 1, A(n, 1) = 3n — 6 and n > 26. The proof of Claim 1 is analogous to 
the proof of Theorem 5 in [H] . In order to keep the paper selfcontained we present 
it in Section [U 

The idea is now to relate ^2 n>0 <k\n) z ™ ^° ^ ne P ower series Xln>o Mfc(n) z 11 . For 
this purpose we compute 
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+ l X ( n + 2b-2i,b-i) + 2\{n + 2b - 2i - 1, b - i) + A(n + 2b - 2% - 2, b - i)\ 



n>0 




^(-l) b 2 2fe ^ A(n, 6) M k (n - 2b)z 



b>0 n>2b 



J2(-l) b z 2b ^ A(n + 26, 6) M fc (n) z 



fe>0 n>0 



Interchanging the summations w.r.t. 6 and n we arrive at 



(3.7) 



E T ?W n = E ^(-l)V fe A(n + 26,6) M k (n) z 



n>0 n>0 |_f>>0 
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Now we use the recursion formula for X(n, b). Let 
(3.8) i Pn (z)=Y J Kn + 2b,b) 



z b . 



b>0 



Multiplying in eq. (13. 6p with z b and taking the summation over all b ranging from 
to [ n /2j implies for (p n (z), n — 1, 2 . . . 

(3.9) ( 1 - z 2 - z 3 - -^-) <p n {z) = (z 2 + ^±T\ Vn-l(z) + (-^-) Vn-2{Z) . 



I — Z J \ 1 — Z J \± — Z , 

We make the Ansatz 

(3.10) f{x,y) = ^^A(n,jV J = J>n(*) g . 

n>0 j'<2 ' n>0 

Multiplying in eq. (13. 9p with and taking the summation over all n > leads to 

the partial differential equation 

(3.H) 

2 3 s \d 2 f(x,y) ( 2 x 2 + 1\ df(x,y) ( x 



X X 1 — x J dy 2 \ + 1 — x / <9y \ 1 — x) ^ 
The general solution of eq. (13.111) can be computed by MAPLE and is given by 

f(x,y) = Fi(x) expif^x) ■ y) + F 2 (x) exp(/ 2 (x) ■ y) 
= ^[F 1 {x)f 1 {x) n + F 3 {x)f a {x) n ]!L, 



n>0 
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where F\(x), F2(x) are arbitrary functions and 

2x 2 - x 3 + 1 + u(x) , 2x 2 - x 3 + 1 - w(x) 

(3.12) /^x) = — — — „ ■ f 2 (x) 



2(1 - 2x - x 2 + x A ) ' J v ' 2(1 - 2x - x 2 + x 4 ) ' 
By definition we have f(x,y) = J2 n >ofn(x) ' 7TT and 

(3.13) <p n (x) = F 1 (x)(f 1 (x)) n + F 2 (x)(f 2 (x)) n ■ 

In order to solve eq. (I3.13P it remains to compute Fx(x) and F 2 (x). The key in- 
formation lies in the initial conditions for f(x,y) and tp n (x). Explicitly we have 
f(x, 0) = 1 and fi(x) = A(l, 0) x° = 1, which implies 

F 1 {x) + F 2 (x) = 1 
F 1 (x)f 1 (x) + F 2 (x)f 2 (x) = 1. 



Accordingly we obtain 

(3.14) Fl{x)= M*)-) and f 2{x) - M*)- 1 



f2{x) - fl(x) - / 2 (z) ' 

In view of <£> n (— z 2 ) = ^ 6>0 A(n+ 26, &)(— l) b z 2fc we can express X] n >o T^(^)^ n as 
follows: 

£lf(n)*» = 5>„(-z 2 ) M fc (n) 

n>0 n>0 

= F.i-z 2 ) ^M fc (n) (/i(-^)^) n + F 2 (-z 2 ) £)M fc (n) (/ 2 (-* 2 )*)' 

n>0 n>0 
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Now we use Lemma [TJ 

E M *w* n = (ih) 5> (2n) (r^Y ' 

n>0 ^ ' n>0 ^ ' 

which allows to express ^2 n>0 T^\n)z n via ^ n>0 /fc(2n) z 2n 

E t?w *• = E aw (i^S)) * ■ 

n>0 ^ v 7 n>0 v J v y/ 



1 - zf 2 (-z 2 ) 

• y/v 1 n>0 



□ 



4. ASYMPTOTICS OF RNA PSEUDOKNOT STRUCTURES WITH ARC-LENGTH > 4 

We set 



(4.1) ^(z) 

(4.2) $ 2 {z) 



1-zM-z*) 
zf2(~z 2 ) 



1 - zf 2 (-z 2 ) • 

Note that i?i(;z) and #2(2) are algebraic functions over the function field K(z). 
Theorem 3. Lei k > 3 be a positive integer and pk,^k denote the positive real 



singularities of F k (z) = ^2 n>0 fk(2n)z 2n and ^2 n>0 T^(n) z n , respectively. Th 



en 



the number of k-noncrossing RNA structures with arc-length > 4 is for k < 9 
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asymptotically given by 

(4.3) T^n) ~ ctn-ak-W+O-W) ( 7 -i)» , 

where 7^ zs i/ie unique positive, real solution of the equation $i(z) = p\~. 

Proof. According to Theorem [2] we have the functional equation 



2d 



n>0 ' v y n>0 



Fk(*l«) 



2/ 2 (-z 2 ) 



We consider the functions $i(z), #2 (2) given by eq. ( 14. ip and eq. ( 14.21) . The mappings 
x t— > ~&i(x) and x h-» #2 0*0 ar e strictly monotone and t?i(x) > ^(^c) for $i(x) g]0, |]. 
Furthermore we have Pk < Pa = \, for > 4. We can conclude from this that 
the real, positive dominant singularity, 7^, of ^ n>0 T[5(n) z n , whose existence is 
guaranteed by Pfringsheim's Theorem [23 ] , satisfies 

(4.4) #i(7fc)=Pfc- 

Being a determinant of Bessel functions [9], -^(2) is ZJ-finite. Moreover $1(2;) and 
$2(2) are algebraic over K(z), analytic for \z\ < S, where 7^ < 5 and satisfy $i(0) = 
^2(0) = 0. Therefore the composition Fk($i(z)), i = 1,2, is D-finite [22] and 
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Fk($i(z)) and F^^&z^z)) have singular expansions, respectively. We further observe 
that neither 1 _^ 1 j~^j 2 ) nor j~|fezjg) have a singularity £ with |£| < 7^. Hence if 
C is a dominant singularity of t[5 (n) z n then it is necessarily a singularity of 
or Ffe(i? 2 (^))- As for singularities of ^(^1(2;)) and ^(^2(2;)), we consider 
for k < 9 the ODE satisfied by F fc (z): 

(4.5) q 0ik (z)^F k (z) + q^tfjL-fFkiz) + ■■■ + q e , k (z)F k (z) = , 

where qj,k{z) are polynomials. The key point is now that any dominant singularity 
of F k (z) is contained in the set of roots of qo tk {z) [22]. Computing the ODEs for 4 < 
k < 9 we can therefore conclude that F k (z) has only the two dominant singularities 
p k and -p k . Let S = {( | i?i(C) = Pfe or i? 2 (C) = ~Pfc}- Tnen 7fe is the unique 
S'-element of minimal modulus. We can draw two conclusions: first 

(4.6) [z n ] lf{z) ~ c fc [z n ] FfcO?^)) for some c fc > 

and secondly, 7^ is the unique dominant singularity of ^ n ^ k {n)z n . In view of 
eq. (14. 6p it thus remains to analyze the subexponential factors of the singular ex- 
pansion of F k ( r d 1 (z)) at z — 7fc. Since ~&\{z) is regular at 7^ we are given the 
supercritical case of singularity analysis [7j. In the supercritical case the subex- 
ponential factors of the compositum, F k { , &i{z)) coincide with those of the outer 
function, F k (z). According to [13] we have for arbitrary k 

(4.7) f k (2n) ~ n-^+V) ( Pk ^) 2n 
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and therefore the subexponential factors of Fk(z) = J2 n >o fki^n)z 2n coincide with 
those of F k (-di(z)), i.e. we have 

(4.8) TfH^^W) ( 7fc -i) B 

proving the theorem. □ 



5. Proof of Claim 1 



We recall that the numbers of fc-noncrossing matchings and RNA structures with arc- 
length > A are denoted by f k (2n) and T k (n), respectively. Furthermore, T\ , (n, £) 
denotes the number of fc-noncrossing RNA structures with arc-length > A having 
exactly £ isolated points, and let f k (m,£) denote the number of /c-noncrossing dia- 
grams with I isolated points over m vertices. Let & n ,k(£, ji, 32, h) be the set of all 
/c-noncrossing diagrams having exactly £ isolated points and exactly j\ 1-arcs, 32 
2-arcs and j 3 3-arcs. We set G k (n,£, ji, j 2 , 33) = \%,k(^J^h,h)\- In particular, we 
have G k (n, £, 0, 0, 0) = T^ 4 '(n, £). We observe that Claim 1 is implied (taking the 
sum over all £) by 

(5.1) TtW) = Y, (-l) b Kn,b)f k (n-2b,£) , 

f><L§J 
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where X(n, b) satisfies the recursion 

A(n, b) = X(n - 1, 6) + A(n - 4, 6 - 2) + A(n - 5, b - 2) + A(n - 6, b - 3) 

b 

(5.2) + ^ [A(n -2i,b-i) + 2X(n - 2i - 1, b - i) + X(n - 2i - 2, b - i)] 

i=l 

- X(n-3,b- 1) 

with the initial conditions X(n, 0) = 1, A(n, 1) = 3n — 6 and n > 2b. 

We shall proceed by proving eq. (15. ip . For this purpose, let A(n, bi, b 2 , b 3 ) denote 

the number of ways to select exactly b\ 1-arcs, b 2 2-arcs and b 3 3-arcs over 1, . . . , n 

vertices. 

Claim A. 

(5.3) 

E (f ) (f ) (f )Gk(n,e,j 1 ,j 2 ,j 3 ) = X{n,b 1 ,b 1 M)fk(n-2{b 1 +b2+b 3 ),i). 

3i>bi,fe>b2,j 3 >b3 1 2 3 

The idea is to construct a family 3" of 5f n .fc-diagrams, having £ isolated points and 
at least b\ 1-arcs, b 2 2-arcs and b 3 3-arcs, respectively. We then express l^l via the 
numbers Gk(n,£,ji,j2,js)- We select (a) b\ 1-arcs and b 2 2-arcs and 63 3-arcs and 
(b) an arbitrary /c-noncrossing diagram over the remaining n — 2{bi + 62 + ^3) vertices 
with exactly i isolated points. Let 3 be the family of diagrams obtained in this way. 
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It is straightforward to show that X(n, bi, b 2 , ^3) satisfies the recursion: 
A(n, 61, 62, 63) = 

A(n - 1, 61, b 2 , b 3 ) + A(n - 2, 6i - 1, 6 2 , 63) + A(n - 4, 61 - 1, b 2 , b 3 - 1) 
+ X(n - 5, &!, b 2l b 3 - 2) + A(n - 6, 61 , 6 2 , h - 3) - X(n -3,b 1 ,b 2 - 1, 63) 

6 

+ ^ [2A(n - 2i - 1, fej, 6 2 - 1, 63 - (i - 1)) + A(n - 2i - 2, 61, & 2 , &3 - i)] 

i=l 
6 

+ ^ [X(n - 2z, 61, 6 2 - 2, 63 - (z - 2))] 

i=2 

with the initial conditions A(n, 0, 0, 0) = 1, X(n, 1, 0, 0) = n — 1, X(n, 0, 1, 0) = n — 2, 
A(ra, 0,0,1) =n-3, n > 2b. 

Clearly, each element 9 G 3" is contained in ^ n ,fc(^, ji, J2, J3) for some ji > 61 and 
j 2 > b 2 and j'3 > 63. Indeed, any 1-arc or 2- arc or 3-arc can only cross at most two 
other arcs. Therefore 1-arcs and 2-arcs and 3-arcs cannot be contained in a set of 
more than 3- mutually crossing arcs. As a result, for k > 3 the construction generates 
fc-noncrossing diagrams. Clearly, 9 has exactly I isolated vertices and in step (b) 
we potentially derive additional 1-arcs and 2-arcs and 3-arcs, whence j\ > b\ and 
j 2 > b 2 and j 3 > b 3 , respectively. Next we observe that we have by construction 



|5F| = X(n, 61, b 2 , b 3 ) f k (n - 2(h + b 2 + b 3 ),£) 
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Since any of the fc-noncrossing diagrams over n — 2{bi +62 + ^3) vertices can generate 
additional 1-arcs or 2-arcs or 3-arcs, we consider 

9 r ( ji , J2, J3) = {0 G "J I 6 has exactly j\ 1-arcs, 3% 2-arcs and j'3 3-arcs}. 

Obviously, we then have the partition "5 = U^&^^&^^SF^'i, jz)- Suppose 
9 G J2) J3)) then 6* G ^,^(^5 Ji, 32, h) an d furthermore # occurs with multiplicity 
(h) ibl) ill) * n ^ s i nce by construction any ^-element subset of the ji 1-arcs and 
6 2 -element subset of the j 2 2-arcs and 6 3 -element subset of the j 3 3-arcs is counted 
respectively in 3 r . Therefore we have 

(5.4) TOi, ji,J3)| = (l 1 ) (fy ^(>-,(n. L . h . . , 

and 

PUuji, js) I = A(n, h, 6 2 , h)fk{n - 2(h + b 2 + b 3 ),£) 

jl>&l J2>£>2 J3>b3 

proving Claim A. We next set 

F k (x,y,z) = ^2 ^2 G k( n JJiih,j3)x n y n z ri ■ 

ji>0j 2 >0j 3 >0 
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Taking derivatives we obtain 

111 77161, &2,&3 (-1 \ 



E 

3l>bl,j2>b2,j3>b3 



and accordingly 



^2 G k{n,Z,h,32,h)x n y n z n 

ji>0,j 2 >0,j 3 >0 



E 

bi>0,fe 2 >0,6 3 >0 



E 



Jl>bl J2>&2 ,i3>f>3 

(x-l) 6l (y-l) fe (2-l) b3 



Jl \ f J2 \ ( J3 



Gk{n,t,h,h,h) 



]T A(n, &!, 62, 6 3 )/ fc (n - 2(6! + b 2 + 63), - l) bl (</ - l)* 9 ^ - l)" 3 • 

bi>0,fe 2 >0,6 3 >0 

By construction G(n, £, 0, 0, 0) is the constant term of the F k (x,y,z). That is, the 
number of fc-noncrossing RNA structures with I isolated vertices and no 1-arcs, 
2- arcs and 3- arcs is given by 



(5.5) G(n,£, 0,0,0)= £ (-1) 

6i>0,6 2 >0,6 3 >0 



bi+b2+bs 



\{n, 61, 6 2 , h)f k {n-2{b 1 +b 2 +b 3 ),£) . 



2(i 



We take the sum over all t and derive 



(5.6) 



tL 41 w 



LfJ 

£ (-l) 6l+62+63 A(n,6 1) 6 2 ,fo 3 ) 

&i>0,&2>0,6 3 >0 



71-2(61+62+63) 

f k (n-2(b 1 + b 2 + b 3 ),, 

£=Q 



Setting 



we conclude first 



X(n,b) = £ X(n,b 1 ,b 2 ,b 3 ) 

bi+b 2 +b 3 =b 



£ (-l) b \(n,b)M k (n-2b) 



and second eq. f)5.2p . completing the proof of Claim 1. 
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179 


647 


2397 


9081 


35181 


139307 


563218 



Table 1. The first 15 numbers of 4-noncrossing RNA structures with 
arc-length > 4 



k 4 5 6 7 8 



% l 6.52900 8.64830 10.71759 12.76349 14.79631 

T l * ] (n) c 4 n _ f(77 1 ) n c 5 ra- 18 (7 5 ~ 1 ) n c 6 n- ! f(% 1 ) n c 7 n" 39 (7f 1 ) n c 8 n" 1 t i (7 8 " 1 ) 



Table 2. Exponential growth rates and asymptotic formulas for k- 
noncrossing RNA structures with minimum arc- length > 4. 



1 2 3 4 5 6 7 8 9 10 11 12 13 




FIGURE 1. /c-noncrossing structures: 2- 3- and 4-noncrossing structures 
(top to bottom). Maximal sets of mutually crossing arcs are colored red. 




12345678 12345678 

(a) (b) 




FIGURE 2. Basic diagram types: (a) 3-noncrossing matching (no isolated 
points), (b) 3-noncrossing partial matching (isolated points 4 and 7), (c) 
4-noncrossing RNA structure with arc- length > 3, (d) 3-noncrossing RNA 
structure with arc-length > 4. 
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Figure 3. The ratio r(n) = jf /(n~ 21/2 ^ n ) as a function of n. The 
curve shows that the asymptotic approximation is valid as r(n) ~ 
c 4 w 4.4509 x 10 7 . 



